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FOREWORD 

The report that follows was prepared by Dr. Paul Horst, Professor 
Emeritus of Psychology, University of Washington, as an outgrowth of problems 
encountered by the University of Hawaii Center foi Research in Early Child- 
hood Education in the course of early attempts to interpret factors 
represented in scores on the Gumpgookles t£St. This is an instrument, 
developed in part through support from the Office of Economic Opportunity, 
which is composed of dichotomous items designed to tap components of 
motivation to achieve in young children. It had been found that factors 
identified through ordinary factor-analytic techniques were confounded by 
response sets of the subjects, notably those associated with position of 
the keyed answer on the page or section of a page presented to the respondent 
and with whether the keyed answer was presented first or second. Hence 
assistance was solicited from Dr. in devising a method of extracting 

factors with response set or "item characteristic" scores partialled out, 
so that the resulting factor scores would be independent of response sets. 

He ha9 worked with the Center as Consultant on this problem, along with 
others. Some of the later aspects of his work on the general problem that 
had been posed have been supported by a grant to The University of Washington 
from the United States Public Health Service. 

In addition to the authors of the test, Dr. Dorothy C. AJkins and 
Dr, Bonnie L. Ballif of Fordhara University, who was also serving the Hawaii 
Center as Consultant, Mr. Renato Espinosa and Mr. Robert Bloedon contributed 
to the project, particularly in development of appropriate computer 
techniques . 
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The success of the approach described by Dr. Horst is attested to in 
another part of the report of the Hawaii Center, n A Wew Approach to 
Response Sets in Analysis of a Test of Motivation To Achieve, 11 by Dorothy 
C. Adkins and Bonnie L . fiallif. In the study reported therein, it was 
empirically verified that the method does yield factor scores that are 
uncorrelated with response set scores, that the factors are characterized 
by KR-20 reliability coefficients greater than zero, and chat the factors 
can be at least tentatively identified. 
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ABSTRACT 



A method is proposed for defining objectively irrelevant item response 
tendencies* A procedure is developed for deriving independent factor scores 
from a set of items in which scores are also independent of irrelevant 
item trait responses. 
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FACTOR SCORES INDErENPENT OF ITEM TRAITS 



I. The Problem 



Much attention has been given to the influence of those characteristics 
of stimulus elements (items, for example) on responses of individuals, aside 
from the actual purported item content* These characteristics may be social 
desirability, acquiescence, polarity of statement (do, do not), preference 
value, serial position, attributes of response categories (primacy, recency), 
etc. The literature in this area is too extensive to make even illustrative 
references feasible here. Perhaps now well structured procedures should be 
developed for the management of extraneous i^em characteristics in the 
evaluation of personality variables. Such an enterprise should probably 
proceed as follows: 

1. Exhaustive identification of those stimulus element characteristics 
which are independent of item content and to which persons may respond dif- 
ferentially. 

2. The definition of the elements of an item characteristic (IC) 

vector such that, when a person's response vector is multiplied by the IC 
vector, one may rationally regard such a product as the person's score for 
that item characteristic. For example. In the case of dichotomous Items, 
an item preference vector would consist of the preference or 'V 1 values of 
the items. The general approach might be as follows: A response matrix 

for a sample of entities is defined with rows <vs entities* An IC matrix 
conformable to the responre matrix on the right has a column corresponding 
to each defined item characteristic. The IC matrix is defined so that the 
product of the response matrix by the 1C matrix yields a matrix that may 
logically be regarded as a matrix of IC scores. Definition of the response 




matrix will depend on the structure of response option patterns and on 
keying procedures. 

tn a^y case, it is probably most appropriate to investigate response 
styles or other extraneous item characteristics only after having opera- 
tionally and computationally defined the corresponding item characteristic 
scores . 

3. Having defined item characteristic scores, one may then develop 
models for utilising these together with item scores in various ways* For 
example: One may partial out the IC variables from the item variables. One 

may include them with item or scale scores in factor analyses. One nay 
utilize them with or without item or scale scores in estimating criterion 
measures. In general, one. is free to utilize the IC variables in any way 
that other variables can be used in multivariate analysis models. 

The crucial consideration is that an entity response vector end an 
item charncterist ic vector each be defined in such a way that their minor 
product constitutes an acceptable definition of an item characteristic score. 

This general problem of achieving from dichotomous items factor scores 
that are independent of particular response sets or item characteristic 
scores was thrust upon the attention of the writer by Dorothy C, Adkins and 
Bonnie L. Ballif in connection with a project of the Center for Research in 
Early Childhood Education at the University of Hawaii. In their early factor 
analyses of items in th„. ** Gumpgookies , a test of motivation to achieve in 
school for young children, attempts to interpret factors were plagued by an 
apparent contamination cf the factors by two types of response sets affecting 
the dichotomous items: position of the keyed answer to an item--right or 

left--and the order In which the keyed answer to an item was presented to 
the subject-- first or second. The methods described herein art an outgrowth 
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of their desire for factors based upon an item intercorrelation matrix with 
response set scores partialled out- They have been applying the method 
with every indication of success (Adkins and Ballif) . 

11, The Rationale 

Let 

n be the number of entities 
e 

n be the number of items 
a 

be the number of item traits 
be the number of factors 

X be an (n x n ) binary item score matrix 
ea e a 

X be an (n^ x n ) item trait matrix 
at at 

M be an item mean vector 
a 

D be an item standard deviation matrix 
a 

x be a Deviation item score matrix 
ea 
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(1) 



We define an (n e x n t > trait score matrix by 



et 



X ea X at 



From (1) we see that a trait score is defined as the scalar product of an 
entity's item score vector by an item trait vector. 

We consider now the general problem of calculating item factor scores 
which are independent of item trait scores* To do this, we first consider 
the problem of calculating item scores which are independent of item trait 
scores. We let x et be an (n £ x n t > deviation matrix of item trait scores. 
It can readily be shown from the definitions and equation 1 that 



x et = x ea X at * 



( 2 ) 



Let 



C aa be an n a 'th order matrix of covariances of item scores; 
C at be an {n a x n t ) matrix of covariances of item scores with 
item trait scores ; 

C tt be an (n t x n t ) matrix of trait score covariances; and 
u ea an (n e x n a ) matrix of item scores independent of item 
trait scores. 



Reversing subscripts to indicate transposition, we have from the definitions 



C aa 


3 

ae 


x ea 


/ 


n e 5 


(3) 


C a, 


a x ae 


x et 


/ 


n e : 


(4) 


C tt 


" x te 


x et 


i 


°e * 


(5) 



We now write 

u ea * x ea * x et B ta > 



( 6 ) 



where B ta is a matrix determined so that u ea is independent of the 
matrix of item trait scores x et , or such that 

O x te u ea " 0 * _ < 7 
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From (4) through (7) 



0 c ta " c tt B ta 



( 3 > 
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And from (8) 

B ta " C tt' c ta • <9) 

Equation (9) is the well known expression for the matrix of mult ^ple 
regression constants calculated from deviation measures. 

From (2) and (6), 



Let 



u ea " x *a « ’ x at B ta> • 



J aa u ae u ea ^ tl e » 



( 10 ) 



( 11 ) 



where G aa is now the covariance matrix of item scores independent of the 
item trait scores. 



From (3), (4), (5), (6). (9), and (11) we have 



G aa = c aa ~ c at c tt c ta 



Also from (2), (3), and (4), 



C a£ = C aa X at . 



and frbm (9) and (12) 



^aa "■ ^aa * ^at ^ta 



( 12 ) 



( 13 ) 



(1A) 



The matrix G aa in (14) is a covariance matrix. To get factor scores 
independent of item trait scores, we consider first the correlation matrix 
derived from G a<? . We let 

D c = diag (G aa ) . (15) 

Then the partial correlation matrix of factor scores with item scores 
partialed out is 

(16) 

We next consider a factor analysis of R aa . First we consider the basic 
structure or principal axes solution for n f factors. We let A f be the 
first n f principal axes factors of R Qa . The method utilized will be 
the basic structure successive factor method (Horst, 1965, p. 160), 
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R aa y c G aa D c 



We let 




A£ = Q£ ^£ , ( 1 ^) 

where Is a diagonal matrix of the r>£ largest eigenvalues of R aa , 

and Qf is a matrix of the corresponding eigenvectors. 

Next we solve for the varimax transformation of the Af matrix by the 
simultaneous factor varimax solution (Horst, 1965, p. 423). This solution 
also incorporates the algorithms used in the principal axes solutions, since 
the method requires a series of basic structure type solutions. We indicate 
this solution by 



B * A f H , (18) 

where H is the square orthonormal transformation which satisfies the 
varimax criterion. 

It should be noted that the varimax criterion is independent of the 
signs of the varimax factor loading matrix. For this reason it may often 
be necessary to determine both row and columi, sign changes for a varimax 
factor loading solution in order to maximize the number of large positive 
elements. For some factor analytic solutions of item variables, it may be 
that some items were not keyed to give the optimal positive manifold. 
Suppose then we let 



b = i L B i R . (19) 

where i^ is a sign matrix operating on rows of B and i R operates on 
columns. As a first approximation we give i^ the signs of the corre- 
sponding elements in A ^ . the first principal axes factor loading vector. 
This we may indicate by 

= sign (D a > , (20) 



We let 



i b - ih B • 



We then take as i R the signs of the column sums of 
2 b ' 1*L B l R ' 



m) 

in (21) and write 
( 12 ) 



5 



10 



Next from ^b we make up a sign matrix fvom the signs of the largest 



absolute values in the rows of ^b of (22) # We then let 
l L = l l L 2 i L • 



(23) 



Hence for the b matrix of (19) the largest element of each row will be 
positive. In general, it can also be expected that the largest element of 
each column will be positive. 

It should be observed that the problem of determining the optima! sign- 
changing matrices i and i for varimax factor loading matrices has been 
L K. 

largely ignored by most investigators, and that this problem is not restric- 
ted to the case where factor scores independent of item trait scores are 
sought . 

To define the trait-free factor score matrix, we return to equation (10), 
Here u ga is an item score matrix independent of the item trait scores, as 
indicated by equation (7). But the scores in this matrix are not standard- 
ized. Therefore we let 



v ea = ll ea D c 



From ('ll), (15), and (16), we see that 
v v / n - II 



ea 



v aa 1 



(24) 



(25) 



and therefore v ea is a matrix of standard measures. 

We now wish to find a matrix of trait-free factor scores y e £ which 
enables us best to approximate v ea in the least squares sense. The 
expropriate model for this solution may be written 

v ea l L ' Vef b ’ = E • < 26 > 

where of course v ea and b are known and the i^ appears on the left 
of (26) to conform to i^ in (19). 
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The least squares solution for y 



In (26) is well known to be 



ef 



y.t ■ \a *L b <*' b) ' 1 • 

From (17), (18), (19), and (27), we get 



\i = v ea ** H l R * 

But from (17) acid (28) we may write 



-2 



y ef = v ea A af A H J R 



If we let 



C ° 4' 2 H i R 

and 

8 = AC , 

from (29) and (31), 

y ef a v ea P • 
From (24) and (32), 
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Let 



y , = u D 8 . 
■'ef ea c 



B af = D c B • 



From (33), (34), and (10), 



y ef “ x ea U " X at B ta^ B af * 



Let 



B tf c B ta B af • 

From (35) and (36), 

y ef “ *ea (B af ' X at B tf> * 
Finally, let 

Baf = B af * x at B tf • 

From (38) 

y ef “ x ea B «f * 
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(77) 

(28) 

(29) 

(30) 

(31) 

(32) 

(33) 

(34) 

(35) 

(36) 

(37) 
(36) 
(39) 



Now by definition 
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^ea ea 
From (39) and (40) 



l M g ' 



'ef 



X ea ®af " 1 V 6 af * 



It can be proved now that 



and 



y £e y ef 7 "e = 7 



y fe X et = 0 



(40) 



(41) 



(42) 



(43) 



or that the factor scores y are uncorrelated and of unit variances, and 
also uncortelated with the itjm trait scores* 

Suppose we wish to transform the factor scores so that they have mean? 



of a r and standard deviations of 8 r 



We may write 



Let 



and 



Y ef = Y ef S C + 1 11 a c 



B * X J*> - s 
af af c 



V r 
a 



a 



- M • B 



c a 

From (43) through (46), 



af • 



ef 



X B * + 1 V 1 • 
ea a f a 



(44) 



(45) 



(46) 



(47) 



Suppose now ve wish to construct an integer scoring matrix E^ from 

such that the largest absolute value in each column of E * is c. , We 

af 1 

let D be a diagonal matrix of the largest absolute values in the columns 
Of B af , 

calculate 

B af X (B af D l) (C 1 + ,999) * (48) 

and take the integer function of B - ; thus 

3 L 

E af " lnt (B af> * < 49 > 

The maximum absolute value in each column of E - will now be Ci . To 

at i 

further simplify the scoring, we may for any integer greater than 1 

let Ej. a ign for f* 0. With this procedure, the numbe: of 
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items discarded because of all 0 elements in a row of E c decreases 

at 

as increases. 

For the integer scoring method for dichotomous items, one may reverse 
the keying of all items with negative scoring weights so that all weights 
become positive* 

The interpretation of the negative scoring weights is that the items 
with negative weights for a factor suppress unwanted variance of other 
factors which contaminate the items with positive weights for the factor. 




III. Computational Procedure 

Ihe computational procedure for the foregoing rationale will now be 
described. The symbol x is read T the expression on the left is replaced 
by the expression on the right, Unless otherwise indicated, subscripts 
refer to order of matrices and reversal of subscripts means transposition 
of the matrix. Exponents in parentheses mean elemental exponentiation. 

Calculate as a minor product of type II vectors (Horst, 1963, p. 143) 



C aa c X ae X ea * 

H a = X ae 1 ■ 

S = X < 2 > 1 . 
D re 



(50) 

(51) 

(52) 



Calculate the Item means (M a ) and item standard deviation vectors: 



H a X / a e , 

S D X 5 D ' °e ■ 

S D ~ (S D ' M a (2> > <%> ' 

Calculate the item covariance matrix: 



C aa x C aa > "e * M a ’ 



(53) 

(54) 

(55) 

(56) 



Calculate the item-by- trait covariance matrix as the major product of 
type II vectors (Horst, 1963, p. 144): 



C 



at 



Calculate 



type II vectors 



c aa *at • (57) 

the trait-by- trait covariance mat lx as a major product of 
(Horst » 1963, p. 144): 



"tt 



X ta C at 
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(58) 



Calculate the inverse of a symmetric matrix (Horst, 1963, p. 461): 
-1 



c tt 1 c tt • 



(59) 



Calculate the regression matrix for estimating the item score matrix 
from the item trait scores: 



B at = c at C tt * 



( 60 ) 



Calculate the covariance matrix of item scores with trait scores 
partialed out: 



^aa X “ ^at ^ta • 



Le t 



D c - diag (C aa ) . 



Calculate 



X D 
c c 






(61) 



( 62 ) 



(63) 



Calculate the item correlation matrix with trait scores partialed out 



C aa ~ D c C aa D c 



(64) 



Calculate the first n£ principal axes factor loading vectors of 
C aa (Horst, 1965, p, 160): 

A - Q f A f . (65) 

Calculate the varimax transformation matrix H (Horst, 1965, p. 428) 
iteratively as follows: 

B ° A H , (66) 

D = diag (B' B) , (67) 

D X D / n a , (68) 

8 Z B* 3 * - B D , (69) 

C = A' B . (70) 

The basic structure of C (Horst, 1965, p. 437) is i 

pdq' »C (71) 

and 

H - p q' . (72) 

Continue (66) through (72) until H stabilizes, beginning with H * I . 
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Calculate sign matrices for S a and Sf , as indicated in equations 



(19) through (23) ( S a = i L ( S f = i R ). 

Calculate the optimal sign varimax factor loading matrix 

B I S a B S f . (73) 

Calculate the varitnax regression matrix as follows! 



K X 


H S f , 


(74) 


K X 


a' 2 H , 


(75) 


B = 


A H , 


(76) 


B X 


D c B , 


(77) 


B tf 


= B ta B > 


(78) 


B I 


b * X at B tf * 


(79) 


B X 


B S c , 


(80) 


V 


= 1' a c M a ' B . 


(81) 



Calculate E as an integer matrix from B as indicated in equations 
(48) and (49). 



Calculate as minor products of type II vectors (Horst, 1963, p, 143) 
the exact factor scores by 

(32) 

the integer weight factor scores by 



y = X B • 1 V 1 , 
B ea a 



y E = X « E > 

and the item trait scores by 

y t = X ae X at • 
Letting 

y = Cy B . y E . y t > , 



(83) 



( 84 ) 
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(35) 



calculate the correlations among the three sets of y scores in (85) as 
follows : 



Calculate as a minor product Type II vector; 

c aa = y' y , (86) 

M a - y' 1 , (87) 

S D - y (2)' 1, (88) 

Calculate the means and standard deviations for the y supermatrix 



by 



X M a 1 n e - 



2 : £ n / n. , 



s D x <S D - ».«>><» 



(89) 

(90) 

(91) 



Calculate the correlation supermatrix for the y scores by 

c aa * < C aa " »a V> 1 < S D S d’> • 
where / means elemental division. 

We may now define 





r bb 


r be 


R Bt 


C aa = 


r eb 


"ee 


R Et 




R tB 


R tE 


R tt 



Ihen we .should have 

r bb - 1 • 

V 1 0 > 



(92) 



(93) 



(94) 

(95) 



and . Rg E and Rg £ should approximate identity matrices while R fi should 
approximate a null matrix. 




IV. Kuder -Richardson Factor Score Reliability 
We may estimate the Kuder -Richardson reliability for the y 
scores for the J'th factor by 
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( 96 ) 



r . 
J 



(n 3 - 1) S c ‘ 



<S, 



- B> 



Bj ) 



where Bj is the j'th column vector of the exact varimax weighting 
matrix ,’hich yields factor rcores with standard deviations of $ c , and 
D g is a diagonal matrix of item standard deviations. This formula is 
based on the assumption that the average item retest covariance is equal 
to the average in Car -item covariance. Presumably, therefore, it may 
markedly underestimate the retest reliability of a factor score* 
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